The Use of Bayesian Networks to Assess the Quality of Evidence from Research Synthesis: 2. Inter-Rater Reliability and Comparison with Standard GRADE Assessment

نویسندگان

Alexis Llewellyn

Craig Whittington

Gavin Stewart

Julian PT Higgins

Nick Meader

Neil R. Smalheiser

چکیده

BACKGROUND The grades of recommendation, assessment, development and evaluation (GRADE) approach is widely implemented in systematic reviews, health technology assessment and guideline development organisations throughout the world. We have previously reported on the development of the Semi-Automated Quality Assessment Tool (SAQAT), which enables a semi-automated validity assessment based on GRADE criteria. The main advantage to our approach is the potential to improve inter-rater agreement of GRADE assessments particularly when used by less experienced researchers, because such judgements can be complex and challenging to apply without training. This is the first study examining the inter-rater agreement of the SAQAT. METHODS We conducted two studies to compare: a) the inter-rater agreement of two researchers using the SAQAT independently on 28 meta-analyses and b) the inter-rater agreement between a researcher using the SAQAT (who had no experience of using GRADE) and an experienced member of the GRADE working group conducting a standard GRADE assessment on 15 meta-analyses. RESULTS There was substantial agreement between independent researchers using the Quality Assessment Tool for all domains (for example, overall GRADE rating: weighted kappa 0.79; 95% CI 0.65 to 0.93). Comparison between the SAQAT and a standard GRADE assessment suggested that inconsistency was parameterised too conservatively by the SAQAT. Therefore the tool was amended. Following amendment we found fair-to-moderate agreement between the standard GRADE assessment and the SAQAT (for example, overall GRADE rating: weighted kappa 0.35; 95% CI 0.09 to 0.87). CONCLUSIONS Despite a need for further research, the SAQAT may aid consistent application of GRADE, particularly by less experienced researchers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessing the Validity and Reliability of the Persian Version of the Interpersonal Problem Solving Skills Assessment Tool in Schizophrenia

Objective: This study aimed to translate the Assessment of Interpersonal Problem-Solving Skills (AIPSS) into Persian and to evaluate the validity and reliability of the Persian version of AIPSS to use for adults with schizophrenia. Materials & Methods: In this methodological study, the translation process was performed according to the International Quality of Life Assessment (IQOLA) protocol....

متن کامل

Towards a Task-Based Assessment of Professional Competencies

Performance assessment is exceedingly considered a key concept in teacher education programs worldwide. Accordingly, in Iran, a national assessment system was proposed by Farhangian University to assess the professional competencies of its ELT graduates. The concerns regarding the validity and authenticity of traditional measures of teachers' competencies have motivated us to devise a localized...

متن کامل

Test-Retest and Inter-Rater Reliability Study of the Schedule for Oral-Motor Assessment in Persian Children

Objectives: Reliable and valid clinical tools to screen, diagnose, and describe eating functions and dysphagia in children are highly warranted. Today most specialists are aware of the role of assessment scales in the treatment of affected individuals. However, the problem is that the clinical tools used might be nonstandard, and worldwide, there is no integrated assessment performed to assess ...

متن کامل

Validity and Reliability Assessment of the Persian Version of Therapy-Related Symptom Checklist

AbstractTherapy-related symptom checklist for children (TRSC-C) was developed as a symptom assessment tool in children receiving chemotherapy. The objective of the present study was to evaluate the validity and reliability of the Persian version of TRSC-C. This cross-sectional study was conducted in 2013-2014 in Tehran, Iran. TRSC-C was translated using backward-forward approach. The content va...

متن کامل

بررسی پایایی بین دو آزمونگر و پایایی یک آزمونگر تست بریف بست در ارزیابی تعادل بیماران مبتلا به سکته مغزی: گزارش کوتاه

Background: Impaired balance is one of the most common symptoms that occur after stroke. There are several tests for evaluating balance in neurological disorders. Brief-balance evaluation systems test (Brief-BESTest) is the short version of BESTest that assess the systems contributing to postural control. The purpose of this study was to investigate the inter- and intra-rater reliability of the...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 10 شماره

صفحات -

تاریخ انتشار 2015

The Use of Bayesian Networks to Assess the Quality of Evidence from Research Synthesis: 2. Inter-Rater Reliability and Comparison with Standard GRADE Assessment

نویسندگان

چکیده

منابع مشابه

Assessing the Validity and Reliability of the Persian Version of the Interpersonal Problem Solving Skills Assessment Tool in Schizophrenia

Towards a Task-Based Assessment of Professional Competencies

Test-Retest and Inter-Rater Reliability Study of the Schedule for Oral-Motor Assessment in Persian Children

Validity and Reliability Assessment of the Persian Version of Therapy-Related Symptom Checklist

بررسی پایایی بین دو آزمونگر و پایایی یک آزمونگر تست بریف بست در ارزیابی تعادل بیماران مبتلا به سکته مغزی: گزارش کوتاه

عنوان ژورنال:

اشتراک گذاری